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Recently^, we presented a general theory for calculating the strength and properties of colloidal interactions 
mediated by ligand-receptor bonds (such as those that bind DNA-coated colloids). In this communication, we 
derive a surprisingly simple analytical form for the interaction free energy, which was previously obtainable 
only via a costly numerical thermodynamic integration. As a result, the computational effort to obtain 
potentials of interaction is significantly reduced. Moreover, we can gain insight from this analytic expression 
for the free energy in limiting cases. In particular, the connection of our general theory to other previous 
specialised approaches is now made transparent. This important simplification will significantly broaden the 
scope of our theory. 



We consider a general system of many linkers, such 
as a solution of colloids coated with DNA strands that 
are capped with reactive sticky-ends. At any given time, 
each linker i can bind at most one other distinct linker j , 
with a free energy change AG^ that depends on the poly- 
mer statistics of the linkers (e.g., length, flexibility and 
grafting position). In many cases, including those of ex- 
perimental relevance, the probability that linker i is un- 
bound is approximately independent of whether or not 
any other linker is also unbound. Here, we show that in 
this limit, the free-energy of interaction of the system is 
given by 



Pij 



i<j 



(1) 



where pi is the probability that linker i is unbound and 
Pij is the probability that linkers i and j form a bond. 
Previously^, we showed that these quantities are given 
by the unique physical solution to the following set of 
self-consistent equations: 



Pij =PiPje 0AG », 
Pi =l~Y,Pij- 



(2) 
(3) 



In what follows, we first motivate the free energy ex- 
pression in Eq. ([IJ through a calculation that closely 
resembles that for mixing entropy of solutions and 
gases. This free energy is minimised for the values of 
{pi} and {p^} that solve the self-consistent conditions 
in Eqs. Q and We then show that the free en- 

ergy in Eq. ([I]) is identical to that obtained through the 
costly and numerical thermodynamic integration previ- 
ously proposed. In the discussion section, we compare 
the performance of Eq. ([!]) with that of the thermody- 
namic integration, establish the explicit connection be- 
tween our Eq. (JlJ and a previous treatment of DNA- 
mediated colloid interactions^, and state the analogous 
result to Eq. (IT]) for the mean-field system of plates dis- 
cussed in Ref. 
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Ensemble realisations 



FIG. 1. Two different realisations of an ensemble of two 
copies. Linkers are depicted as straight lines, and bonds are 
shown as filled circles. Although the numbers of bonds formed 
in each ensemble, {Nij}, are equal, the number of copies 



where both i and j are unbound, N- 



differs. 



I. DERIVATION OF THE MAIN RESULT 

We consider here an ensemble of N independent copies 
of the real system. In each copy, a different set of bonds 
forms between the linkers (see Figure [TJ . Let Ni be the 
number of copies where linker i is unbound, and let Nij 
be the number of copies where i and j are bound to each 
other. Conversely, let N-i t -j be the number of copies 
where both i and j are unbound. These quantities are not 
independent: each linker i is either unbound or unbound, 
so 



(4) 



The fraction of copies where i is unbound (fi), where 
i and j are bound to each other (fij), or where they are 
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both unbound follow immediately: 



fi = Ni/N, 
fij = Nij/N, 
i,-S = N-i,-j/N. 



(5) 
(6) 
(7) 



Let Z({Nij}) be the partition function of an ensemble 
under the constraint that each pair of linkers i and j is 
bound to each other in exactly N^ copies. A closed-form 
expression for Z({Nij}) can be constructed recursively, 
by adding each bond one by one. For a given set of 
{Nij}, we need to work out how Z({Nij}) changes upon 
adding one more i-j bond, which we do as follows. We 
call the set of realisations of the ensemble with {Nj} 
bonds the old ensemble, and that of realisations with one 
more i-j bond, the new ensemble. In the old ensemble, 
there are AT_ i: _j copies where an i-j bond can be added. 
In doing this, all the realisations of the new ensemble 
are generated, but not uniquely. For example, given two 
realisations, one with an i—j bond in copy X but not in Y 
or Z, and one with an i—j bond in Y but not in X or Z, the 
same final realisation can be obtained by adding ami — j 
bond to Y in the former and X in the latter. Conversely, 
in the new ensemble, we can generate a realisation of the 
old ensemble in Nij + 1 ways by removing one of the i-j 
bonds. For example, the old realisation with an i-j bond 
in copy X but not Y or Z can be obtained by deleting the 
i — j bond from a new realisation with an i — j bond in X 
and Y but not Z, or from one with an i — j bond in X and 
Z but not Y. Since the number of ways of going from the 
old to the new ensemble is equal to the number of ways 
of going from the new to the old ensemble, it follows that 



= z(-. 
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The value of N-i t -j depends not just on the values of 
{N^} but on the details of how those bonds are dis- 
tributed between copies (see Figure [TJ . To remove this 
complication, we approximate the probability of j be- 
ing unbound as independent of whether or not i is also 
bound. Hence, 



■V/ , « Nfifj 
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(9) 
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Neatly, this approximation allows us to treat iV_ 
a function of only {Nij}. From the discussion above, we 
obtain an expression for the increase in Z({Nj}) upon 
adding one i-j bond to the system: 

Z(--- , Njj + !,-■■) _ e-P^vNjNj 
Z{— ,Nij,---) ~ N(Nij + l) 

= e-^ G »(N-Z k N ik )(N-Z k N jk ) 
N(Nij + 1) 

This recursion relation, and the fact that Z = 1 when no 
bonds form, allows us to write an approximate closed- 
form expression for Z({Nj}), namely 
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Using Stirling's approximation and Eq. (Ill, the free en- 
ergy per copy (3f* tt = -{1/N) In Z({NijJ) is then given 
by 



pfittiifij}) = E fn^Gii + EC 1 - E ln ( : ~ E /«) + E fn + E fk 



(12) 



Treating {/y} as continuous in the range [0, 1], the over- 
all free energy per copy of the ensemble, i^ttj follows 
from a saddle-point approximation: 



N 



hi 



-W/3/a*tt({/^}) 



\i<3 



PflMij}) + OQnN/N) « Pf* att ({fij}), (13) 



where the integration is over all positive values of {fij} 
satisfying ^ • /y < 1 for all i and the values {fij} are 
obtained by minimising the free energy per copy, 



= 0. 



(14) 



I 

When N — > oo, the values {fij} are precisely the average 
values of {fij}. Eq. (14 1 implies that {fij} and {pij} 
obey identical equations (Eq. (pi)), and so are equal: 



Pij. 



(15) 



A. Connection to thermodynamic integration 



The free energy F* tt , defined in Eq. (13), is equal to 



the free energy of the real system to the extent that the 
approximation in Eq. ^ is valid. Since this is the same 
approximation that we used previously^ to calculate the 
free energy in terms of a thermodynamic integral, F att , 
it is reasonable to suppose that F* tt and F at t are equal. 
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We now show this explicitly. 

In our original paper, we calculated the exact attrac- 
tive free energy for the real system of linkers using ther- 
modynamic integration. Specifically, we replaced (3AGij 
by (SAGjj+A, whereupon the probabilities {pi} and {pij} 
become functions of A. We then integrated the appropri- 
ate free energy derivative over the range < A < oo, and 
obtained 



^ att = -/ dA ^A~ 



dA^fti(A). (16) 

i<3 



The same replacement of f3AGij with f3AGij + A can be 
made in the ensemble of N copies. In that case, using 



Eqs. Q and Q, we find that 

dpij 



dA 



i<3 



Of, 
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(17) 



The first term vanishes owing to Eqs. ( 14 ) and ( 15 ), and 



the second term follows immediately from Eq. p2|. We 
then have 



dA 



dA 



(18) 



Moreover, both F* tt and i^tt are zero when A is infinite, 
so the two quantities are equal for all A. 



The previous result, together with Eqs. ( 12 )— ( 15 1, 



yields the following closed-form expression for F a 



pF att = ^pijfiAGij + ^(1 - ^2 Pij) ln(l - ^Pik) + ^2 Pij ln Kj +^Pij- 



l<3 



l<3 



l<3 



This expression is, in fact, equivalent to the much more compact Eq. |l]). Concretely, 



/0-F.tt = ^PijpAGij + 5^(1 - ^Pij) lnjjj +J2,Pij ln(piPje-P AG v)+J2Pi3> 

i<3 i 3 i<3 

* 3 i,3 



i<3 



(19) 
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(20b) 
(20c) 
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II. DISCUSSION 

Figure [2] reports the typical computational speedups 
obtained by using Eq. (II) versus our original thermody- 
namic integral, Eq. ((l6l, for systems of M linkers. The 
speedup is higher for larger M and for stronger bonds 
because each evaluation of the thermodynamic integrand 
involves solving a system of M equations, and the size 
of the integration domain scales linearly with the bond 
strength. Typically, experimentally relevant regimes deal 
with hundreds to tens of thousands of strands, a regime 
that can now be treated exactly with Eq. 0. 

In the limit of weak-bonds, where ptj is close to 0, we 
find that 



l<3 



(3F att = ^2\np l + ^2p lj =^2ln(l- ^ pij) + p^, 

i i<j i j 

i<3 



i<j 



This approximate result has been widely used by pre- 
vious authors^"^ under the name of the "Poisson ap- 



proximation" or the "weak binding regime". However, 
in experiments with micron-sized DNA-coated colloids, 
this approximation can be significantly inaccurate^. At 
the nanoscale, where high bond strengths are commonly 
used, the Poisson approximation is expected to break 
down (see Fig.[3]for comparison). Eq. is instead quan- 
titatively accurate for bonds of any strength^. 

Eqs. 0, p| and also make explicit the connection 
between our theory and previous treatments. For exam- 
ple, Dreyfus et alP model the attraction between two 
DNA-coated spheres by first estimating the maximum 
number N p of linkers on each sphere that could form a 
bond with a linker on the second sphere, and then assum- 
ing that each such linker can independently bind any of 
k linkers with an average free energy AF tcthcr . For later 
convenience, we define a small expansion parameter x as 



(21) 



and note that the experiments in Ref.[2]take place mostly 
in what the author call the "weak-binding regime" , where 

In their model, the partition 



x < 1 and N p x > 1 
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FIG. 2. Co mputational speedups obtained by using Eq. |T]) 
vs. Eq. ( 16 1, for a system of parallel plates coated with com- 



(22) 



plementary linkers. 

function of the system is 

Z« (l+x) N *, 

from which the free energy F at t follows P 

0-Fatt « - ln[(l + x) N *] = -N p ln(l + x) (23) 
^-N p (x-x 2 /2 + ---). (24) 

In the present framework, which does not treat the link- 
ers as binding independently, every linker has the same 
probability p of being bound, given by the solution to 



1 y/T+Ix - 1 

^p = 



1 + xp 2x 
The free energy then follows from Eqs. 0, j2j: 

/3^att =iV 2 x + 2iV p mp 
w -A p (:r-:E 2 + ---) 



(25) 



(26) 
(27) 



Thus, our theory recovers the results of Dreyfus et al. in 
the weak binding regime. However, there is significant 
disagreement already at second order in a;, where linkers 
begin to compete for binding partners. 

Finally, using the same procedure as in our original pa- 
per, we can directly write an attractive free energy den- 
sity for a pair of plates, treated at a more approximate, 
spatial mean-field level. In the notation of Ref. [TJ 



/3/att = -X ^ a aPaK al 3Pf3(Jp + ^ (7 a \np a - 



(28) 



a,/3 



This result also follows from a large-area limit of Eq. ( 19 ) 
with random grafting points 1™. We expect the simplifi- 
cation provided in this communication will boost the use 
of our model for calculating interactions free-energy for 
general ligand-receptor mediated systems. 
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FIG. 3. Free energy per linker in the "Poisson approx- 
imation" and the present model. Higher values of x = 
k exp(— /3AFtother) lead to higher bonding probabilities, ei- 
ther because bonds are stronger or because linkers have more 
binding partners. The two models agree in the "weak bind- 
ing regime" (i < 1), but disagree when correlations between 
neighbouring strands become significant. 
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